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ABSTRACT 

In cosmological structure formation models, massive non-linear objects in the process of formation, such 
as galaxy clusters, are surrounded by large-scale shocks at or around the expected virial radius. Direct ob- 
servational evidence for such virial shocks is currently lacking, but we show here that their presence can be 
inferred from future, high resolution, high-sensitivity observations of the Sunyaev-Zel'dovich (SZ) effect in 
galaxy clusters. We study the detectability of virial shocks in mock SZ maps, using simple models of cluster 
structure (gas density and temperature distributions) and noise (background and foreground galaxy clusters 
projected along the line of sight, as well as the cosmic microwave background anisotropics). We find that at an 
angular resolution of 2" and sensitivity of 10/iK, expected to be reached at ^ 100 GHz frequencies in a 20hr 
integration with the forthcoming ALMA instrument, virial shocks associated with massive (M ~ 10'^ Mq) 
clusters will stand out from the noise, and can be detected at high significance. More generally, our results im- 
ply that the projected SZ surface brightness profile in future, high-resolution experiments will provide sensitive 
constraints on the density profile of cluster gas. 

Subject headings: cosmology: theory - cosmology: observations - large scale structure of universe - cosmic 
microwave background - galaxies: clusters: general 



1. INTRODUCTION 

In cosmological theories of structure formation, non-linear 
objects form when overdense dark matter perturbations turn 
around, col lapse, and settle into virial equilibrium (e.g. 
iPeeblesI 11993 and references therein). Gas initially col- 
lapses together with the dark matter, but eventually encounters 
nearly stationary material that had akeady collapsed. Since 
the gas is falling in at supersonic velocities, it is slowed down 
by hydrodynamical shocks, and these shocks are thought to 
heat the gas to the virial temperature of the dark matter halo. 

In spherically symmetric models, and in the absence of 
dissipation, a single strong gaseous shock occurs at approx- 
imately half of the turn-around radius (Bertschinger 1985), 
coinciding with the "virial radius" of the dark matter halo. 
More realistically, the behavior o f the post-shock g as depends 
sensitively on its cooling time jRees & Ostrikedli977 ). On 
galactic scales (M ^ 10'^ Mq) and below, and increasingly 
toward high redshifts (z > 3), the gas can cool rapidly and 
loose its pressure support, and hence continue its infall. On 
these scales, the existence of large-scale shocks have been re- 
cently called into question by models in which the bulk of the 
infalling gas remains cold, and reaches th e centra l regions of 
the dark halo before enc ountering shocks (iBirnboi m &Deke3 
iKeres et aP 120041) . On larger scales, however, where 
cooling times are long, such as for galaxy clusters, the ex- 
istence of virial shocks remains an unambiguous prediction 
of cosmological structure formation theories. Detailed three- 
dimensional hydrody namical simulations of cluster formation 
(e.g. JEvrard.l99a: iBrvan & Normanl[T998h have confirmed 



the existence of virial shocks, with strong discontinuities in 
gas density and temperature. These and subsequent simula- 
tions have also revealed that the infall is anisotropic, with gas 
falling in to the cluster potential along cosmic filaments. As 
a result, the radial location and strength of the shocks varies 
along different directions. 

The virial shocks are a fundamental ingredient of cos- 
mic structure formation, and may be responsible for diverse 
phenomenae, such as generating large-scale cosmic mag- 
netic fields (Bagchietal. 2002) and accelerating electrons 
to contribute to t he diff use cosmic gamma-ray background 
(Loeb & Waxmanl l200(]l) . The radial location of the shocks, 
in principle, also contains information on the cosmological 
parameters (Verde et al. 2002). Despite their importance, di- 
rect evidence for the existence of such shocks does not yet 
exist. The major difficulty in observing the virial shock is 
that it is expected to lie several Mpc (and several arcminutes) 
away from the cluster center, a location at which signals such 
as the X-ray surface brightness (Tozzi et al. 2000), or galaxy 
number dens ity/peculiar veloc ities (which could reveal den- 
sity caustics. iRines et al.l2003h diminish rapidly. 

In this paper, we consider the detectability of virial shocks 
in future observations of galaxy clusters through the Sunyaev- 
Zel'dovich (SZ) effect. The thermal SZ effect is a secondary 
distortion of the cosmic microwave background (CMB) spec- 
trum caused by the hot intra-cluster gas along the line of sight 
to the surface of last scattering (see Sunyaev & Zeldovic^ 
for a review). The cool CMB photons undergo inverse 
Compton scattering on the hot electrons, gaining on average 
a small amount of energy in the process, creating an intensity 
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decrement at low frequencies (i/ < 218GHz) and an increment 
at high frequencies. The SZ effect is the dominant source of 
CMB anisotropy at small angular scales. 

The SZ effect has rece ntly become a valuable observational 
tool (Birkinshaw 1999). Several programs have begun to 
map out massive clusters of galaxies, study the intracluster 
medium (ICM), and constrain cosmological parameters. Cur- 
rent instruments are now detecting and imaging clusters at 
high signal-to-noise, and the next generation of instruments 
should be capable of mapping significant portion s of the sky 
as a means of finding clusters of galaxies (see Carlstrom et alJ 
I2OO2 for a review). Several studies have predicted the number 
of clusters that could be detected in future SZ surveys fBartl^ 
pool; Holder etal. 2000; Haiman et al. 2001; Barbosa et aQ 
fl996t iKneissl et al. 2001). The survey yields are quite im- 
pressive. Next generation instruments, such as the Atacama 
Cosmology Telescope (ACT), South Pole Telescope (SPT), 
and the Planck satellite', are expected to detect several clus- 
ters per day; the large resulting samples can be used to select 
the most massive and most regular clusters that will be best 
suited for the studies proposed here. 

The SZ effect is ideally suited to study the "outskirts" of 
clusters, because the SZ temperature decrement profile is rel- 
atively flat (e.g. AT cx p, whereas the X-r ay emission is pro- 
portional to the square of the local density; Komatsu & Seliak 
(|2001)). Although our main focus is to assess the significance 
at which the shocks can be detected, we also consider the 
more general problem of constraining the cluster gas distri- 
bution, as well as the structure of the dark matter halos them- 
selves. 

The detection of sharp features, such as the virial shocks, 
calls for high sensitivity, high-resolution maps of the SZ sur- 
face brightness profile of the most massive clusters. For this 
reason, we here focus on predictions appropriate for the At- 
acama Large Millimeter Array (ALMA)^, a telescope array 
expected to be operational in 2012 and deliver ^arcsecond 
resolution, high-sensitivity imaging of clusters. Our results 
can be scaled to apply to other instruments with different pa- 
rameters. 

This paper is organized as follows. In §|5] we summarize 
the relevant characteristics of ALMA. In §|3] we describe our 
models for the structure of galaxy clusters. These models are 
based on standard descriptions of gas in hydrostatic equilib- 
rium with a dark matter halo, except that we introduce addi- 
tional free parameters that allow us to vary the location and 
sharpness of the 'virial shock. In § |4] we compute and con- 
trast the SZ surface brightness profiles in models with differ- 
ent virial shocks. In §0 we discuss the sources of noise in 
the SZ surface brightness maps. In § |6l we present simple 
estimates to argue that, in the face of noise, smooth cluster 
profiles can be distinguished at high significance from pro- 
files that include a virial shock. In §0 we go one step further, 
and compute the statistical accuracy at which the location and 
sharpness of the shock fronts, as well as other parameters de- 
scribing the cluster profile, can be determined from future SZ 
maps. Finally, in §|8]we summarize our conclusions and dis- 
cuss the implications of this work. Several technical points, 
related to our statistical analysis, are discussed in the Appen- 
dices. 

Throughout this paper we assume a standard cold-dark 

' See www.hep.upenn.edu/act, astro.uchicago.edu/spt, and 
www.rssd.esa.int/index. php?project=PLANCK, respectively. 
^ See www.alma.nrao.edu. 
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matter cosmology (ACDM), with (Ua, ^m, ^h. Ho) = (0.7, 
0.3, 0.045, 70 km s~' Mpc~'), consist ent with the recent re- 
sults from WMAP JSper^el et al.l2003l) . 

2. ALMA 

The three crucial characteristics of any instrument for the 
detection of the virial shocks are angular resolution (needed 
since the features caused by the shocks are sharp), angular 
coverage (needed since the features can extend coherently 
over arcminute scales), and sensitivity (the surface bright- 
ness change across the shock-front is as small as a few /^K). 
ALMA has an ideal combination of these three qualities, and 
is well suited to studying the gas density cutoff. ALMA will 
be comprised of sixty four 12-meter sub-millimeter quality 
antennae, with baselines extending from 150 m up to 10 km. 
Its receivers will cover the range from v = 70 to 900 GHz. 
Anticipated SZ temperature sensitivity for a Af, « 60.? in- 
tegration period is between ao ~ 100/iK and 24mK for the 
compact configuration depending on the frequency of the de- 
tection. 

The expected aperture efficiency of an antenna has been 
taken from Butler et al. ( 1999). Generally, a baseline B corre- 
sponds to a beam radius 

A0=-=2"xf^— )"7-^V\ (1) 
^ lyB VlOOGHz/ V300my 

The resulting temperature sensitivities for the compact con- 
figuration (B = 150m) is given in TablelU^ In our calculations 
below, we will assume a frequency of 100 GHz, but Tabled 
shows the required integration time and baseline for several 
different frequencies. The second and third columns show 
the beam diameter and the sensitivity at a fixed integration 
time of 60s. In the fourth and fifth columns, we calculated 
the baseline needed to have Acf) < 2" and the corresponding 
sensitivity at the same fixed integration time of 6Qs. In the 
last column we calculated the integration time Afint needed to 
reduce the rm.s. noise variance to 10/iK (using the scaling 

ao oc ^/B^Atini)- 

For the present study we will assume that the sensitivity 
of the detector is (Tq < 10/iK, and its angular resolution is 
A<j) « 2". As Table ^ shows, these assumptions are realis- 
tic for ALMA, requiring 23-170 hr integration for all fre- 
quencies below 350GHz. For even longer observation times, 
the detector noise can be neglected compared to the statisti- 
cal fluctuations of astrophysical origin (the latter, which has a 
magnitude of ^ 10/iK, will be discussed in §|5lbelow). 

^ Adopted from |http://www.ahna.nrao.edu/info/sensitivities| 
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3. SELF-SIMILAR CLUSTER DENSITY AND TEMPERATURE 
PROFILES 

In this section, we describe the models we adopt for the 
density and temperature profiles of gas in the cluster These 
models assume that clusters are described by spherically sym- 
metric dark matter halos. We first follow .Komatsu & SeliakI 
(|2001 ) to obtain self-similar gas density and temperature pro- 
files. These profiles are then truncated and normalized ac- 
cording to the assumed location and sharpness of a virial 
shock. 

Many high-resolution N-body simulations suggest that the 
dark matter density profile pdmir) is well described by a self- 
similar form: pdmir) = Psydmif/rs) where is the mass den- 
sity normalization factor, is a length scale, and ydmix) is a 
non-dimensional function representing the profile. The den- 
sity normalization, ps, is determined to yield mass Mvii when 
Pdmir) is integrated within the virial radius, rvh . The length 
scale is defined as r, = r^ijc, where c is the concentration 
parameter. We use a fitting formula from detailed gas dynam- 
ical simulations for the concentration parameter (see eq. ^| 
below), and calculate rvir(Mvir,z) from the top-hat collapse 
model (see discussion below). 

The dark-matter profile is approximated by the following 
analytic form 

^''-^"^ %^(l+x)3-^ ' 

whe re the parameter a is assumed to be eithe r a = 1 ora = 3/2 
(see iNavarro et al ] fT997l iMoore et al.llT999l and LTing & Sutd 

The gas density profile can be obtained using a polytropic 
model in hydrostatic equilibrium with the dark matter back- 
ground. In this case, the gas density and temperature profiles 
assume a self-similar form, 

Pgas(^/rs) = Pgas(0)ygas(?-/ ^s) (3) 

7gas('-/r,) = 7gas(0)y^-'(r/r,), (4) 

where Pg^s cx PgasTgas c< has been used. The hydrostatic 
equilibrium equation 

Pg3S 



1 dPgas ^_qM{< r) 



(5) 



can be solved JSuto et alJll998l) for y^^J^x) for a fixed dark 
matter mass distribution. 



, , , _i7-l c [ m{u) 
7 m(c)Jo 

where is an integration constant, x is defined as 

r 



(6) 



(7) 



and m(u) is the dimensionless mass within a distance u from 
the center 

m(x) w mdm(-x) = 47r / duu^ydmiu)- (8) 

JO 

These integrals can be evaluated analytically JSuto et alJ 
[l998) for the particular cases a=l and a = 3/2. 

The mass-temperature normalization is given by the virial 
theorem 

_i _ G^njpMvir 



V = 



(9) 



Both theoretical and numerical studies assert that the gas 
density profile traces the dark matter density profile in the 



outer regions of the halo. Therefore, the slopes of these 
two profiles are assumed to track each other closely for 
c/2 < X < 2c. This requirement fixes the polytropic index 
7, and the normahza tion (see equations 22, 23, and 25 in 
iKomatsu & Seliak 2001). 

The concentration parameters, c and c^fw can be written in 
terms of the virial mass and redshift for a given cosmologi- 
cal model. A fitting formula based on numerical simulations 
(Eke et al. 2001) is 



cnfw - 



6 / Mv: 



1+Z V 10l'*/j-'M(r 



-1/5 



(11) 



Equation (llOt supplies a one-to-one correspondence between 
c and Mvii . 

The self-similar model defined above does not assign a 
value for the gas density normalization pga,s(0). Its value can 
be calculated by requiring that the ratio of the total dark mat- 
ter and gas mass within some radius jc^ax attain the univer- 
sal average value of Simulations without feedback 
from galaxy formation typically find values for the cluster gas 
mass fraction that are only slightly lower than the input global 
baryon fraction (Evrard 1997). Using cosmological parame- 
ters cited above 



yOga.s(0) = P: 



^gas(-^max) ^dm 



(12) 



where mgas(x) is the accumulated dimensionless mass of the 
gas analogous to mdm(-^) defined in equation (|8|i. The param- 
eter /Ogas(O) has only a mild x^ax dependence for reasonably 
low Xtnax values. Using a fixed x^ax = 5c yields a maximum 
error of 10% for x^ax G [c, 10c]. 

We calculate the virial radius, rvh, using the spherical top 
hat collapse model, i.e. we take it to be the radius of a 
spherical region with mean interior overdensity relative 
the critic al (obtained from the fitting formula in equation 6 of 
iBrvan & Normanlll998h that encloses a total mass Mvii . The 
corresponding angular radius of the cluster is 



©vir — 



dA(z) 



(l+zf 
' ddz) 



(13) 



where we adopt a fitting formula (|Pen| |1999 l) for diiz). With 
this choice, the only free parameters needed to specify the full 
gas density profile, /Oga,s('"), are Myh-, a, and z- 

Let us now introduce a truncation of the density (and tem- 
perature) profile (|2j to account for the shock front near the 
virial radius of the cluster As pointed out above (see eq. ll2> . 
such a truncation was already needed for calculating the nor- 
malization of the gas density. Let us assume that the density 
has a linear cutoff between radii x^ax - D and Xmax + D. At 
Xm-dx + D the density attains the background value. Since this 
value is small compared to ydm(c) for realistic clusters, we set 
both the gas density and temperature at x > x^ix + D to zero. 
Since the gas density profile traces the dark matter profile, 
the linear density cutoff can be imposed directly on the gas 
density. In what follows, we will refer to the original density 
profile (eq. |6} as ygasaix). We will denote the density profile 
with a cutoff by 3'gas(-^), defined by 



3'gas(-^) = yg.i^.(x)W(x) 



(14) 
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where 

W(x) = I [(x„ 




if X <x„ 



-D 



, + D)-x\/{2D) if Xmax + £><X< 
if Xtnax + £> 



-D 



(15) 

For consistency, the Pga,s(0) gas density normaHzation has 
to be recalculated from equation \\2\ using the mass m{x) en- 
closed by the density profiles truncated with a given choice of 
-^max and D (see eq.|8}. In practice, Pgas(O) is sensitive to x^ax 
only, with the D dependence nearly negligible. Thus, for cal- 
culating the normalization /3c;as(0), we always assume D = in 
equation (I12> . 

4. SZ SURFACE BRIGHTNESS PROFILES 

The two-dimensional SZE intensity profile I{x), as a func- 
tion of projected radial distance x away from the cluster cen- 
ter, is often expressed as a small temperature change of the 
CMB spectrum. The SZE corresponding temperature incre- 
ment Ar(x) (which can be negative) is defined by 



I{x) = i\T(x). 



(16) 



dTcMB 

Solvi ng for the SZE temperature yields dCarlstrom et alJ 
l20()2i) 

/^T{x) = myc{x)TcMB, (17) 

where 6 = hv/kBTcuB is the dimensionless frequency, yc is 
the Compton parameter, and f{9) is given by 



m= e 



■1 



4 (i+w^,?;)). 



(18) 



The 5sze{x,T^) term is the relativistic correction to the fre- 
quency dependence, which is negligible at 100 GHz, but be- 
comes important at higher frequencies (> 250 GHz). The 
Compton yc-p^"ameter is defined as 



(19) 



where gt is the Thompson cross-section, n^, is the electron 
number density, is the electron temperature, kg is the Boltz- 
mann constant, mgC^ is the electron rest mass energy, and the 
integration is along the line of sight (i.e. along r = Vx^ + l^ 
for a given x). We calculate the electron temperature, r^, with 
Te = Tgas using equation (|9}, and the number density is given 
by 

Pgas(0)3'gas(-X) 



Heix) = ■ 



(20) 



where nip is the proton rest mass, and /i = 0.59 for an ionized 
H-He plasma with 25% Helium abundance by mass. 

Substituting 0, (|9}, and in (IT9jl the SZ surface bright- 
ness profile in\ is separable into a dimensionless integral 
with spatial dependence and a constant coefficient 



AT(x) = AT,Y(x), 



(21) 



where 



and 



at; = mTcMB<JT -\ ^^Ps.s(0)v (22) 



Y{x) = 2 d/[3;gas(Vx2 + /2)]T. (23) 
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1. — Predicted SZ surface brightness profiles for A/yir = 10 Mq, 
a= 1.5, and u = lOOHz. The solid curve shows the profile without a 



Fig. 
z = 0.1 

cutoff (the calculation used .Vmax = 200 and D = in practice), and t he do tted 
line corresponds to the SZ profile with .Vmax = c and D = 0. Equation I21i was 
evaluated with Ar(0) = O.VmK. 




Fig. 2. — The difference between the normalized brightness profiles 
with and without a cutoff, for various D values. The difference is taken be- 
tween the ^T(x) profiles with different .tmax and D parameters, but identi- 
cal (Mv„-,a,z) = (10"Mq, 1.5,0.1), and Ar, values. The SZ profiles with 
^raax = 2c and Z) = 0, 0.2c, and 0.4c were compared with the same profile 
without a cutoff, for which jTmax = 200 and D = was used in practice. The 
fact that Ar — Ar"'' is nonzero at jc = is a consequence of the jCmax depen- 
dence of Y(x). 



Figure[2depicts the Ar(x) profile for a = 1 .5 with and with- 
out a cutoff. Without the cutoff, the ygas(.^) and Ar(x) profile 
has a nonzero limit for large x. Various choices of x,nax can be 
compared by analyzing the difference between the associated 
intensity profiles. Figure|2]shows the difference between pairs 
of profiles with jc^ax = 2c and without a cutoff, using either 
D = 0, 0.2c, or 0.4c for the truncated profile. All other pa- 
rameters, such as Mvir, a, z, and AT^ were taken to be equal. 
Note that the AJix) profile is obtained from Y{x) by mul- 
tiplying by the ATj coefficient. According to equation Mil . 
various x^ax choices generally lead to unequal AT, values, im- 
plying a large nonzero difference between the AT{x) profiles 
at X = 0. In principle, this difference is physical, reflecting 
the normalization criterion we chose (namely that the clusters 
contain the average baryon fraction within the virial shock - 
no mechanism is known to segregate baryons from dark mat- 
ter on these large scales). Since the normalization is model- 
dependent, one might worry that it will effect our analysis 
below, where we study the detectability of the virial shocks. 
However, we find that the information from the normalization 
does not dominate our results (see discussion below). 

Table|2]shows the central Ar(0) values for several different 
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TABLE 2 

Central SZE decrement temperatures 





a 


100 GHz 


150 GHz 


200 GHz 


250GHz 


10" 
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-0.019 


-0.012 


-0.003 


0.006 


10" 


1.5 


-0.029 


-0.018 


-0.005 


0.010 


10'^ 


1 


-0.106 


-0.067 


-0.018 


0.036 


1014 


1.5 


-0.173 


-0.110 


-0.030 


0.059 


10" 


1 


-0.609 


-0.385 


-0.106 


0.207 


10" 


1.5 


-1.017 


-0.643 


-0.178 


0.345 


Note. — The SZE decrement temperatures are given in mK. 
We assumed x^nx = 2c, D = 0.01c and z = 0.3. Ar(0) varies by 
less than 20% for the range c < Aimax < 10c. 



cluster masses and density profile slopes, and for various fre- 
quencies. We find that the a = 1.5 dark matter profile yields 
cx 1 .6 X larger Ar(0) values than the a=l model. We can elu- 
cidate the source of this increment by tracking the differences 
between the two models in equations i21\ and (I22> while fix- 
ing the other parameters at Mvii- = IQ^^Mq, z = 0.3, x^ax = 2c, 
and D = 0.01. The difference is caused by the change in the 
product /3gas(0)77F(0)/c. First, since the dark matter model 
with a = 1 .5 is concentrated more in the central region, the gas 
density and the gas temperature is expected to be higher in the 
center for a= 1 .5. Indeed, /9gas(0) = 4.5 x 10'^ psc for a = 1 .5, 
while it is 1.0 x lO^p^c for a= I. Second, the 77 parameter is 
simply proportional to the central gas temperature rgas(O) (|9}, 
which is somewhat higher (by 21%) for a= 1.5. Third, the 
1/c factor further increases the difference by 70%. Finally, 
the increment caused by variables localized to the center is 
smeared by Y, which accounts for the fact that the observa- 
tion measures the projection of the intensity along the line of 
sight. In particular, 7(0) = 0.13 for a = 1.5 and 7(0) = 0.71 
for a= I. Therefore, the resulting increase in the central SZE 
temperature is 60% for a= 1 .5. 

5. NOISE ESTIMATES FOR FUTURE SZ BRIGHTNESS MAPS 

We shall now summarize the primary observational diffi- 
culties for detecting virial shocks.'* The possible sources of 
contamination are detector noise, atmospheric fluctuations, 
and other uncertainties of astrophysical origin. Inevitable as- 
trophysical contaminants include radio point sources, bright 
foreground (and background) clusters along the line of sight, 
and the primary and secondary CMB anisotropics. 

As pointed out in §|2] above, the atmospheric and detector 
uncertainties will not be crucial at moderate frequencies, as 
the ALMA system can reduce its noise level below IQfiK for 
a 2" beam at lOOGHz in an integration time of ^ 20hr. 

Similarly, bright extragalactic foregrounds will not pose a 
great barrier against the observation of virial shocks. The ra- 
dio point sources cover only a negligible portion of the set of 
relevant pixels, and their removal reduces the efficiency of a 
high resolution detection by a negligible amount. The flux of 
the brightest clusters identified in separate wide-field X-ray 
and SZ surveys can also be subtracted from the image. More- 
over, uncertainties from the imprecision of this information 
can be minimized by choosing galaxy clusters with no obvi- 
ous overlapping foreground clusters. 

4 This discussion ignores obvious additional theoretical modelling diffi- 
culties, such as departures from the spherical cluster models adopted here. 
These will be discussed in ii l6.2l and iilslbelow. 



The magnitude of uncertainties from primary and sec- 
ondary CMB anisotropics can be assessed by the analysis of 
the corresponding angular power spectra. The primary CMB 
anisotropy supplies the dominant power on large-scales, but 
is exponentially suppressed well below 10/iK for scales un- 
der ^ 10'. The angular radius of the clusters decreases be- 
low this scale at z > 0.19 for Mvi, = IO'^Mq and z > 0.08 
for 1O'"*M0 (eq. I13> . We shall restrict our calculations 
to larger redshifts, where secondary CMB anisotropics will 
likely dominate. The two largest sources of the secondary 
CMB anisotropics are the fluctuations from unresolved back- 
ground clusters (i.e. overlapping faint clusters of varying tem- 
peratures, sizes, and numbers along the line of sight) and 
the Ostriker-Vishniac (OV) effect. The OV contribution can 
be approximated by a flat band power contribution of IfiK 
JHu & Dodelson 2002). According to both theoretical and 
numerical s tudies ( Holder & Carlstrom 2001; Holder 2004ji 
lKomatsu"&^SeliaL.2002;.Springel et al..2001.) . the rms power 
of the unresolved thermal SZE contamination is between 1 
and 6^K on arcminute scales. For low-mass clusters this 
noise background is strongly non-Gaussian, and there is a sig- 
nificant amount of uncertainty in the small-scale power spec- 
trum, due to non-gravitational effects such as gas cooling and 
feedback from star formation. For a conservative estimate, we 
shall assume that the fluctuations around the SZ profile are 
given by white Gaussian noise with amplitude = 10/xK. 

6. SIGNIFICANCE OF DETECTING VIRIAL SHOCKS 

In this section, we consider the detectability of virial shocks 
in galaxy clusters. The basic input to this analysis is a (mock) 
SZ surface brightness map of the cluster We compute the SZ 
profile AT^%x) according to equation (12 1> in a mock "fidu- 
cial" model, in which the cluster is assumed to be spherically 
symmetric with a well-defined edge, at the position of the 
virial shock. This fiducial model is uniquely described by 
five parameters, specifying the global properties of the cluster 
(Myii-,z,a) and the location and sharpness of the virial shock 
(jcfjjaxi^)- For concreteness, in the numerical calculations, we 
use a single, fixed measurement frequency of 100 GHz. This 
value was chosen to minimize the integration time needed for 
ALMA, while keeping the total SZ decrement around its max- 
imum in the nonrelativistic regime. 

Our basic task is to evaluate the significance, given the ^ 
IQfiK noise, at which AT^^{x) can be distinguished from a 
different model profile Ar"^^'(x), computed in a test model 
that does not include a shock. This task immediately raises 
the question: what should the no-edge benchmark model be 
for this comparison? 

The simplest approach would be to choose the underlying 
density profile to be identical to that in the fiducial model 
within X < xjjfax- In the absence of a virial shock, however, 
the smooth density profile would extend to some large ra- 
dius. This could be expressed by choosing the same param- 
eters (Mvir,z,a,Z)) as in the fiducial model, but replacing the 
location of the virial shock by jcj^ax ^ However, a compli- 
cation is that our procedure to self-consistently normalize the 
density profile (eall2> would then fail, due to the divergence 
of the mass integral (eq. |8}. To resolve this ambiguity, we 
shall adopt either of the following two approaches to define a 
smooth, no-edge "test" model (hereafter referred to as Model 
I and Model II, respectively): 

1. The central gas density /Cigas(O) is chosen to equal the 
central density in the fiducial model. Equivalently, the 
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central decrements A7i are assumed to be the same for 
the two hypotheses. The value of xj^l'^ is then irrelevant, 
as long as it is chosen to be sufficiently large, x^^l'^ ^ c 
(in practice, we find that x'^^^ > 10c suffices). 



2. The location of the virial shock jcjjjl^ is chosen to equal 
the radius at which the cluster gas density equals the 
background baryon density. This value is between 
4.5 < JCmax/'^ — 6 for typical clusters with virial mass 
lO'^M© < Mvii < 1O'^M0, at any redshift and a (either 
1 or 1.5). The gas density normalization is calculated 
self-consistently, using equation il2i . The profile out- 
side x^jI'^ is truncated with D = 0. Note that the temper- 
ature decrement scale ATs and the central temperature 
decrement Ar(0) of this model are both different from 
the fiducial model (by - 10% for Ar(0) for x^^^ = c). 

Model II has the advantage of a closed and self-consistent 
theoretical description, where Pgas(O) and thus AT, is de- 
rived directly from a set of underlying assumptions. How- 
ever, we note that the SZ brightness profile in this model dif- 
fers from the fiducial model because of the "renormalized" 
value of the temperature decrement AT,. Hence, the dis- 
tinguishability of test model II from the fiducial model will 
have an additional indirect dependence on the presence of 
the virial shock, through the effect of the virial shock on the 
overall normalization of the density profile. This inference 
will then be model-dependent (i.e. on our particular assump- 
tion about the relationship between the location of the virial 
shock and the normalization of the gas density), and also sub- 
_^ct to uncertainties due to physical effects (i.e. preheating; 
[Holder & Carlstrom 2001) that can significantly modify the 
inner density profile. Nevertheless, we find in our calculations 
below that the change in the temperature due to its renormal- 
ization is relatively small, and changes in the SZ brightness 
profile are only significant in the inner regions. Overall, the 
temperature change adds nearly negligibly to the detectability 
of the virial shock. Hence, in practice, model II is a useful 
alternative to model I. The latter, by construction, isolates the 
effect on the surface brightness profile of a sharp discontinuity 
itself. 

The expected SZ profile of a cluster with or without a cutoff 
can be calculated from equation (12 1> in the fiducial model, 
and compared to the profile in either test model. In the next 
two subsections, we quantify the significance for detecting the 
difference between these pairs of models. 

In these estimates, we will assume that the interferometer 
has fully synthesized an aperture, yielding a narrow (~ 2") 
effective point response function (PRF). As discussed above, 
this assumption is realistic for ^ 20hr interferometric observa- 
tions with ALMA. Thus, we will employ the simple, single- 
dish technique of estimating S/N, in which the image of the 
extended source can be thought of as being composed of K 
independent pixels (where each pixel covers a solid angle of 
~ 12 square arcseconds). 

Finally, as stated above, in the simplest estimates, we fix 
the parameters (Mvir,z,a, AT^) in Model I, and the parame- 
ters {Myir,z,oi) in Model II, to be the same as in the fiducial 
model. While Mvii and z can be independently estimated from 
gravitational lensing. X-ray, and optical data, the decrement 
normalization AT^ of the fiducial model will not be known a- 
priori. Therefore, to obtain a more realis tic e stimate for the 
detectability of the virial shock, in section l6^ below. we will 
relax these assumptions. More specifically, we will obtain fits 



to the mock SZ profile of the fiducial model, by allowing Ar^ 
(as well as the concentration parameter c; see further discus- 
sion below) to be free parameters in test model I. 

6.1. Significance of Shock Detection in Fixed Test Model 

We shall first derive a simple estimate for the significance 
for choosing between profiles with and without a cutoff, using 
the comparison between test model I and the fiducial model. 
Since all of the parameters of the test model are fixed, this ex- 
ercise will yield an estimate for the significance of the change 
in the SZ surface brightness maps caused by the presence of 
a shock front. The presence of the shock, in general, is more 
difficult to infer when the test model's parameters are not as- 
sumed to be known a priori. This question will be taken up in 
§IObelow. 

The cutoff radii in the fiducial and test models are taken 
here to be x^^^ = c and x^^^^ = 10c, respectively (although our 
results below do not depend on the latter choice). The ef- 
fective "signal to noise ratio" is calculated by comparing the 
difference in the "signal" 5AT{x) = AT'^^x)- AT^'^{x) to the 
noise power in the regular two-dimensional angular space. 

The effective signal to noise ratio corresponding to a single 
pixel at the projected radius x is 



S \ ^ max,( Ar"='^'(-«) - T^\x)f 



( AT'^\Q) \ " max,.(}'"=^'(x) - Y^\x)f 



\ (Tn J Ffi'i(0)2 

/ Ar(0) \ ' (Y'''\xf^J - Y^Hx^^,,)? 



\ (Tn J 



7(0)2 



(24) 
(25) 
(26) 



In the last two lines, we have dropped the superscript from 
Ar(0) and Y(Q) because these have the same values in the 
fiducial and test models (although Ar(0) and F(0) do depend 
on Xmax, the difference between the fiducial and test models 
is negligibly small). We have also adopted D^'^ = for the 
fiducial model, in which case the maximum of SAT(x) occurs 
at x^^^ (see Fig.|2}. For a massive cluster with M^h- = IO'^Mq, 
a = 1 .5 and z = 0.3, we find Ar(0) = 1 .06mK, and 



(Y (^max) Y (XjYiax)) 

Ar(0)V4= 1-13 X 10'' 



= 6.0x10"'* (27) 
(28) 



implying that S/N w 2.6 for the single best pixel on the clus- 
ter surface, i.e. at the pixel corresponding to the edge of the 
cluster at the virial shock. These values increase somewhat 
with increasing z. 

At an effective angular resolution of Acj), the total number 
of resolved pixels covered by the cluster is given by the (solid 
angle extended by the circular disk within the shock radius) 
divided by the (solid angle extended by a single pixel): 



Ktc 



max vir 



A02 



(29) 



Note that for this estimate, we consider the cluster to extend 
out to x^^^; this will result in an underestimate, since the dif- 
ference between the fiducial and test SZ profiles extends out 
to larger radii (see Figure Since = 8.1' for z = 0.3 for 
a massive cluster, we expect Ktoi = 5.9 x lO'* pixels for the 
angular resolution A<j) = 2" of ALMA. Of these Kiot pixels, 
Kedge ^ (2ji;max0vir)/(cA0) He along the 1-pixel wide circular 
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annulus along the virial shock. A quick estimate for the to- 
tal S/N distinguishing the two models is given by adding the 
maximal single-pixel S/N in quadrature, i.e. 




For a massive cluster Myir = lO'^M© at z = 0.3, we find 
^edge ~ 490, and the detection of the virial shock at Xmax = c 
with ALMA has a signal to noise ratio S/N ~ 57. Since 
this value is from a single pixel-wide ring around the virial 
shock, it is strictly only a lower limit. Figure |2] shows that 
there is an annulus of finite AjCedge width around the virial 
shock with a significant 5/S.T . Axedge can be estimated with 
the radial distance where 5/S.T drops to half of its maxi- 
mum value. A simple formula found from approximating 
Ar(x) to second order in x is AjCedge = -^^mlx/^-, leading to 
■'^edge ^ (■Xm'ix©vir)^/(2c^A0^). For a massivc cluster, we ob- 
tain /Tedge = 3.0 X lO'* and = 440. We find litde depen- 
dence of this estimate on a for choices of a = 1 vs. a = 1.5. 
On the other hand, we find a redshift dependence: larger z de- 
creases 0vir but increases 5/SJ{x^^^. The combined result is 
a minimum in the total S/N at z = 0.28. Lower-mass clusters 
have a smaller angular size and a lower SZ decrement, so S/N 
decreases with Myir. Overall, the virial shock is most visible 
for clusters that are either close-by (z ^ 0), or at high redshift 
(z ^ 0.28), and that have a large Mvii and a. 

To improve the simple estimates above, we can average the 
S/N over the face of the cluster. This leads to the following 
approximation for the average single-pixel S/N: 

(31) 

In Appendix A, we give a rigorous derivation of the S/N for 
distinguishing between models in the presence of a Gaussian 
noise. We find that equation ( 13 U . with the average taken in 
the interval Q <x < x'^^^, multiplied by the square-root of the 
number of pixels, y/K^t (eq. I29> . indeed gives the formally 
correct answer. Accordingly, the significance of distinguish- 
ing test model I or model II from the fiducial model is given 
by 

S _ VK j A7:,y/d2x[ytest(_,)_yfid(^)]2 (I) 

VAaN I ^ J d2x[Ar;<=^'y"=»'w- Ar/dFfid(j^)]2 (n) 

(32) 

Here K = (x'^^j^Qyir)^ /{cA(p)^ is the number of pixels, and A = 

2 

""■^max is the area covering the cluster's surface. Note that the 
only difference between Models I and II is that in the latter 
case, the temperature scale AT,"^^' is recalculated using the 
cutoff location x'^l'^. The right-hand-side of equation i32\ 
can be compared with Figure|2] which plots the integrand for 
Model I. 

We evaluated the significance given by equation (I32> for 
various choices of fiducial models. Recall that the fidu- 
cial model is specified by five parameters {Myii,z,a,Xja.^^,D). 
The results are listed in Table |3] for three different masses 
(Mvir = 10'^ 10'"^, IO'^Mq) and two different inner slopes 
(a = 1 , 1 .5). The other three parameters in this Table are fixed 
at the values of {z,Xma^,D) = (0.3, c, 0.01c). Figure |3] shows 
how the S/N varies with z and Xn,ax for these masses and 
slopes (we fixed D = 0.01c in this Figure). 



TABLE 3 

Typical S/N ratios for detecting the virial shock 





a 


e 


c 




|A7-r'| 


„tcst 
max 


S/N 


S/N 


[Mq] 




['] 




[mK] 


[mK] 


[C] 


I 


II 


10" 


1 


1.8 


7.9 


0.02 


0.02 


4.6 


0.6 


0.7 


10" 


1.5 


1.8 


4.6 


0.2 


0.2 


4.7 


0.6 


0.8 


lO'" 


1 


3.8 


5 


0.1 


0.1 


5.2 


18 


17 


lO"* 


1.5 


3.8 


2.9 


1.3 


1.1 


5.4 


19 


19 


1015 


1 


8.1 


3.1 


0.9 


0.8 


5.8 


550 


440 


10'5 


1.5 


8.1 


1.8 


8.3 


6.9 


6 


570 


480 



Note. — In all cases, we assume z = 0.3, jcjjj?,^ = D = 0.01c. 
The self-consistently derived .vjjj^'j; value is used for Model II, whereas a 
fixed = 10c was assumed for Model I. 



TABLE 4 

Typical S/N ratios for detecting the virial shock 



Mvir 


a 


e 
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4.6 
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0.4 
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3.8 


5 


0.1 


0.1 


5.1 


8.9 


10 


10'^ 


1.5 


3.8 


2.9 


1.2 


1.1 


5.3 


9.6 


12 


1015 
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8.1 


3.1 


0.9 


0.8 


5.7 


280 


240 


10'5 


1.5 


8.1 


1.8 


7.9 


7.0 


5.9 


300 


280 


Note. 




Same 


as Table I3I except 


we assume 


fid 
max 


= 2c and D^'' = 



0.5c. 



In order to study the sensitivity of the S/N to the choice 
of the fiducial parameters of the virial shock itself, in Ta- 
ble0]we repeat each calculation in Table|3] except we replace 
(xf^^^,D^'^) = (c,0.01c) by (x|jf^^,D«'*) = (2c,0.5c). The latter 
choice should also be regarded as more realistic, given that 
the strong virial shocks are found in simulations to be located 
at ~ twice the virial radius (e.g. Figure 16 in Bryan & Nor- 
man 1998), and may not be perfectly sharp. As Table|4]shows, 
the larger value of xf^^^ decreases the S/N by about a factor of 
two (see Figure|3}- 

The main conclusion that can be drawn from Table |3] and 
Figure |3] is that the S/N is a strong function of the size 
of the cluster: the virial shock of a massive cluster with 
Mvii- ^ IO'^Mq is detectable at high significance (S/N ^ 
500), but the detection becomes marginal for low-mass clus- 
ters or groups with Myir ^ lO'^^Mo, and clearly impossible for 
even smaller objects. 

We also find that the S/N is nearly identical for different 
choices of a and D. However, Figure |3] shows that S/N de- 
creases with Xmax, and it is a non-mono tonic function of red- 
shift with a minimum at z ~ 0.3. Two of our findings are 
somewhat counterintuitive: cluster edges are equally visible 
for smeared edges, and are more visible for more distant clus- 
ters, even though these have a smaller angular size. The first 
of these results can be explained by recalling the definition of 
the cutoff. Increasing D does not change the total cluster SZ 
intensity. If the cutoff shape is known exactly prior to the ob- 
servation, as it is assumed in this section, increasing D (while 
fixing Ar,) does not come closer on average to a profile with- 
out a cutoff. 

To explain the second pecuUarity, the behavior of S/N as a 
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Fig. 3. — The signal to noise ratio for various cluster masses as a function 
of ^max (upper panel) and z (lower panel). Each curve has only one parameter 
changing, the other parameters are fixed at the values listed in Tablel3l 



function of z, we considered the z-dependence of each indi- 
vidual factor that determines the S/N. The most puzzling fea- 
ture, i.e. the increase of S/N towards high redshifts at z > 0.3, 
is attributable to an increase in the temperature scale and in the 
central temperature decrement Ar(0) (shown in Figure |4}. 
Although the angular radius <d decreases somewhat towards 
high-z, the temperature increase dominates. As a result, more 
distant clusters at z > 0.3 have a higher contrast, which makes 
the virial shock more visible even though the clusters have a 
smaller angular size.^ At low-redshift (z < 0.3), the situation 
is reversed, and the strong increase towards z = in the angu- 
lar size is the dominant effect, making the virial shock of very 
nearby clusters more detectable. 

Table |3]also compares the S/N results for Model I and II. 
The two models lead to approximately equal S/N detection 
ratios. This is somewhat surprising, since the renormalization 
in the temperature scale AT, and the difference in the pro- 
file shape Y{x) in Model II both introduce additional differ- 
ences from the surface brightness Ar(x) in the fiducial model. 
We find that these additional differences can be significant, as 
high as 2.5(7 for each pixel near the central region x ~ 0. How- 
ever, the difference is only at the level of ~ 0. 15cr per pixel at 
the radius x ^ x,^^^ that dominates the cumulative S/N. 

In conclusion, the estimates in this section indicate that the 
temperature decrement difference across a virial shock front 
causes a difference of approximately /S.T ^ 10/iK in the SZ 
surface brightness maps. This is a small difference, approx- 
imately at the level of the expected noise. Hence, in sin- 

^ For still larger redshifts, when then cluster's angular size drops below the 
angular resolution of the instrument, the S/N will have a cutoff. However, at 
the A</< = 2" resolution of ALMA, this critical redshift is well beyond the 
epoch when clusters form in a sensible cosmology. 
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^^^^^^^ ^ - 
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Fig. 4. — The central SZ decrement Ar(0) for various cluster masses 
(Mvii ) and power-law slopes (a) of their inner density profiles, as a function 
of redshift. Each curve has only one parameter changing, the other fixed 
parameters correspond to those listed in Tablel3l 



gle neighboring pixels, the shock fronts will not stand out 
to be detectable. However, shock fronts are coherent struc- 
tures extending over many pixels; in the simple spherical 
models considered here, the cumulative S/N ratio is suffi- 
ciently high to infer the presence of the shock front with a 
high-resolution instrument, such as ALMA. The presence of 
shocks in medium-sized to massive clusters should be clearly 
inferable, as long as the shocks are located near the virial ra- 
dius. If the density cutoff is farther in the outer region, the 
virial shock will only be observed for relatively massive clus- 
ters, with M > few x 10'"^Mvir. The detection significance is 
nearly independent of a and D. The virial shocks of clusters 
at z ^ 0.3 are least detectable, with nearly an order of mag- 
nitude increase in the S/N for z ~ or z ~ 3 compared to 
the minimum value at z = 0.3 Assuming that the noise is un- 
corrected and Gaussian for different frequencies, measuring 
at many different frequencies the signal to noise ratio can be 
increased by the square-root of the number of available inde- 
pendent frequency channels. 

6.2. Significance of Shock Detection in Variable Test Model 

The main deficiency of the signal-to-noise ratio analysis in 
the previous subsection is that at least some of the parameters 
of the test model will not be known a priori. This, in general, 
will make the cluster's virial shock more difficult to detect, 
since one may be able to adjust the variable parameters of the 
no-edge test model to better mimic the fiducial model that 
includes the shock. The mass Mvir and redshift z of the clus- 
ter can be independently estimated (e.g. from gravitational 
lensing. X-ray, and optical data), and the inner density pro- 
file slope a has a relatively small impact on our conclusions. 
Hence, we still fix the values of these parameters in this sec- 
tion. Although the SZ temperature decrement scale AT,"''*' 
can, in principle, be derived in the fiducial model, it will not 
be known a-priori. We therefore allow it to be a free param- 
eter In addition, the concentration parameter of the cluster 
c, which, in principle, can be ultimately derived from simu- 
lations, is still somewhat uncertain. Therefore, we allow c"''*' 
to be the second of our two free parameters of the test model 
in this section. The maximum radius used for the comparison 
of the SZ profiles including and lacking a cutoff is xj^^'^, the 
value used in Model II previously. Recall that this is the radius 
at which the baryon fraction equals the global value ilb/f^dm- 

A different complication is that our analysis relies on mock 
data, rather than actual data. Therefore, in the presence of 
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noise, our "data" should not be a fixed set of numbers (i.e. the 
profile AT(x) in the fiducial model). Rather, it must be a (set 
of) probabilistic variables that have a distribution and finite 
width. 

A rigorous signal-to-noise analysis that incorporates these 
two complications is presented in Appendix B. In this case, 
the S/N of distinguishing the test model from the fiducial 
model is itself a random variable (rather than a fixed number, 
as we have hitherto assumed). However, in Appendix B we 
show that the expectation value of S/N obeys the following 
bounds; 

5^^ A^^5 4^"-'' ^''^ 

c2 . 



where Sq is defined by 



min / d'-x[AT"''\x,i/^'')-AT^'^ix,p'"')]\ (34) 



The term p"^**' refers to the two variable parameters of the test 
model (i.e. AT^'"^^ and c''^^')^ and p^^ refers to the fiducial val- 
ues AT,^^, c^^, x^.,^, and D^^. This distinction is necessary 
since the corresponding parameters need not be equal. For a 
conservative estimate we shall use the lower bound in equa- 
tion (I33> . The expected value of the signal to noise ratio is 
therefore estimated by 



^exp 



AK 

- mm \ 

5 A UN P'° 



d2x[AP'=»t(jC,P"='')-Arfid(;C,pfid)]2 



(35) 

This should be compared with equation J32> describing the 
signal to noise ratio for fixed parameter values. The differ- 
ences are the overall ^^4/5 factor and taking the minimum 
over the p"^^' parameter space. We shall denote the best fitting 
test parameters, i.e. at which the integral is minimal, by /9q^'. 

Equation ( I35> measures the probability that the virial shock 
of the cluster is observable even in the face of two additionally 
variable parameters. 

The numerical results presented in the last section in Ta- 
ble |3] and Figure |3l considered 79"^'*' parameter values that 
were equal to the corresponding p^^ parameter values. Here 
we consider the fiducial model with p^"^ = (z, ajXroaxji*) = 
(0.3, 1.5,2c,0.5c) and Mvi, = IO^'M© or IO'^Mq. The fidu- 



cial values (Ar/'^,c'''*) are calculated from equations ( I10> and 



M2\ as before. The parameters p"^'*' are assumed to be the 
same for (Mvir,z,a) as in the fiducial model, but, in contrast 
with the previous section, (Ar^^'^^'jc"'^') are allowed to vary 
relative to the fiducial values. 

The fractional deviation of the signal to noise ratio as com- 
pared to the original value of test model I for various p^'^^ 
parameters is plotted in Figure |5] The \/A-/5 factor decrease 
because of the increase in the average noise power is not in- 
cluded in the contour levels for clarity. Thus, the contours 
have the value of 1.0 at the fiducial parameters. The best fit- 
ting parameters /?q ^' correspond to the values at the minimum 
of S/N. 

The most important result from the analysis displayed in 
Figure |5l is that the signal to noise does not decrease signif- 
icantly, even in the best-fitting model, relative to the values 
listed in the previous section (Table |3l and Figure |3}- By ob- 
taining the best fit for various x^ax we find that the maxi- 
mum decrease is a factor of 0.95 x ^4/5,0.9 x ^4/5, and 
0.5 X ^475 for Mvh- = 10'^Mq,10'4Mo, and IO'^Mq re- 
spectively. The virial shock detection probabilities therefore 




Fig. 5. 



The 



") parameter dependence of the S/N 



ratio for detecting the virial shock of a tiducial cluster with parameters 
pfl'' = (z.a.-Vnux.O) = (0.3, 1.5, 2c, 0.5c) and M^i, = 10"*Mq (lower panel) 
or 10"Mq (upper panel). The fractional change in the S/N is plotted 
relative to the original S/N in Model II. The contours increase linearly in 
steps of 0.1 for SS/N > I, and in steps of 0.05 for 5S/N < I. The test 
model does not have a virial shock, and its best-fitting parameters are bi- 
ased. The minima are marked with an X in the upper and lower panels. 
These values are 0.85 for (AT^'''''/AT^'^,c'^''yc^'^) = (1.38,0.38) and 0.94 
for (AT^'=''^/AT^'^,c'^'"/c^'^) = (0.91, 1.14), respectively. The relatively mi- 
nor decrease in the S/N indicates that variations in c and AT, cannot mimic 
the presence of a virial shock, i.e. that the shock remains a distinctive feature. 



remain significant for Mvh = 10 M©, but become marginal 
(S/N < 10) for low-mass clusters (Myt < lO'^^M©). 

Another interesting feature shown in Figure [S] is that the 
best fitting parameters can deviate significantly from the fidu- 
cial values. Changing Myir, JCmax and D shows that the S/N of 
particular clusters can have saddle points and multiple local 
minima in terms of the p^''^^ parameters. We find that this is 
the case, for example, for Mvh- = lO'^^M© and 1 < x^^^/c < 
1.5, where the local minimum near the fiducial values is not 
global, and the global minimum (AT^q'^'jCq'*') instead lies at 
a much larger SZ temperature and a much smaller concen- 
tration. This behavior follows from the fact that decreasing c, 
while fixing all other parameters, produces a steeper Ar"^'*^-^) 
profile on log-log scale in the outer regions, a better approxi- 
mation of the fiducial model with a virial shock. On the other 
hand, the decrease in c also slightly decreases the SZ tem- 
perature in the inner regions, which can be compensated by 
increasing AT,. Overall, a test model profile with a smaller T, 
and larger c comes closer to the fiducial model. 

In Appendix C, we present a simple method to obtain ana- 
lytical estimates for the best-fitting ^' parameters of a false 
test model. This approach is fully general, and gives a nu- 
merically efficient way to compare two arbitrary models with 
arbitrary numbers of parameters. Our method is somewhat 
analogous to a Fisher matrix approach. The standard Fisher 
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matrix treatment serves to estimate the expected uncertainty 
of the parameters of a correct model. In contrast, the method 
presented in Appendix C gives an estimate of the best-fitting 
parameters in an incorrect model (and does not address the 
uncertainty of these best-fitting values). 

7. PARAMETER ESTIMATION 

In the previous section, we have examined the signal to 
noise ratio for distinguishing test models from the fiducial 
model. The primary aim was to make a distinction between 
the two basic possibilities, i.e. whether the cluster does or 
does not have an edge. A more ambitious problem is to ex- 
amine the precision with which the location and sharpness of 
the edge can be measured, i.e. the precision of determining 
the parameters that yield the profile Ar*'''(x). We calculate 
the uncertainty of the parameter estimation using both a non- 
linear likelihood function directly, and with an approximate 
Fisher matrix method. 

We adopt the fiducial model of the previous section to pre- 
dict the SZ temperature decrement profile. As before, the 
fiducial model includes an edge, and is specified by the five 
parameters Mvii, z, a, JCmax, and D whose fiducial values will 
here be collectively denoted by p^'^. We then compare the 
temperature decrement in the fiducial model, AT{x,p^'^), to 
two different sets of test models. The test models in this case 
are identical to the fiducial model (i.e., they include an edge), 
except we allow a subset of the parameters to have values 
^test y ^fid xjjese two cases are analogous to the test mod- 
els I and II in the previous sections. In the first case (Model 
I), we consider a set of four unknowns ATj, c, x^ax, and D. 
In the second case (Model II), we consider only two free vari- 
ables jCmax and D, and fix Ar^ and c at the values calculated 
in the fiducial model using equations i22\ and (I10> . The other 
parameters describing the cluster, Mvir, z, and a, are held fixed 
in both cases. 

Now let us suppose that a temperature decrement AT°^'^(x) 
is measured, and with no prior information, a parameter p"^'*' is 
to be chosen that best describes the data. Due to random noise, 
the measurement yields a parameter estimator with some un- 
certainty. The parameter estimator and its uncertainty can be 
obtained with the maximum likelihood test, following Ap- 
pendix A. The likelihood function is 



L(AT°^\p) = PiAT"^" I AT{p)) = 
K 



1 



exp 



d^x{AT°'°Hx)- AT(x,p)f 



(36) 



The constant normalization pre-factor is irrelevant for the 
likelihood ratio test, which we hereafter omit from the like- 
lihood function. The log likelihood is 

\nUAT°''\p) = --^ f d^x[AT'"'%x)-AT{x,p)f. 

(37) 

where K and A are the total number of pixels and the surface 
area of the cluster The parameters are then chosen to maxi- 
mize L(AT°^'^,p), or equivalently by applying a least squares 
fit to the data. 

The signal to noise ratio is obtained by the analysis of 
the projection of the SZ observation profile on the subspace 
spanned by the fiducial model profiles with arbitrary parame- 
ters. The noise power for A^'''' variable parameters is the vari- 
ance on this subspace, A^^ = N^'^crj^, and the signal to noise 
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Fig. 6. — The (.v[J;fx.^'°*') parameter dependence x^-contours for Model II 
with (z.o.Xmaxif) = (0.3, 1.5,2c,0.5c), using Mvir = IO'^'M© (lower panel) 
and Mvii- = lO'^M© (upper panel). The contours increase Unearly in steps of 
0.5f7^. 



ratio is therefore 



K 



^2 - AN^<^aN- 



d^x [AT^'^(x,p''"")-AT'"'(x,p'"')Y (38) 



fid/ 



.,fid^^2 



Equation ( I38t measures the extent that the parameter es- 
timator is p"''*' instead of p^'^, the true (and maximum like- 
lihood) value. S^/a^ follows x^-statistics with A^'^'' degrees 
of freedom, leading to S^/N^ = 1 ± ^JlJW^ for la errors. 
Given a true parameter set p^^, the region within 2a confi- 
dence (95%) for example is the set of p"''*' values for which 
S^/N^ < l+2^/2/N^. The uncertainty of the parameter esti- 
mation can therefore be read off directly from equation OSt . 

Figure|6lshows contour diagrams at fixed values of this un- 
certainty in Model II, assuming the fiducial parameters p^'^ = 
(z,a,x^^^,D) = (0.3,1.5,2c,0.5c) with M = lO'^^M© (upper 
panel) or M = IO'^Mq (lower panel). The contour plots al- 
low Xmax and D to vary, while all other parameters are held 
fixed at the fiducial values. 

Figure |6l depicts the likelihood contours for Model II with 
the direct evaluation of the S/a^. An estimate for these con- 
tours is provided by the Fisher information matrix. 



FijiP)^ 



d^lnL(AT°^',p)\ 



(39) 



K 



^2^^ dAT(x,p''^) dAT{x,p''^) ^^^^ 



dpi 



dpj 



Assuming that the likelihood distribution is Gaussian near 
the peak likelihood, we can use confidence limits for Gaussian 
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statistics (i.e., x^) to obtain 68% and 95% confidence regions. 
The Fisher matrix greatly speeds up the computations, and 
allows us to consider additional free parameters (Model 1). 

The minimum expected varia nce is related to the F isher ma- 
trix by the Cramer-Rao bound iTe ^mark et al.l 1997b 

(Apjf > (F-')jj (41) 

where equality holds if the distribution is well approximated 
by a Gaussian distribution. 

Tables |5] and |6l show the 68% and 95% significance un- 
certainties for Models 1 and 11, as calculated from the Fisher 
matrix. Generally, increasing the number of free variables 
can increase the uncertainty of the original variables, since 
a model with additional free parameters can better mimic 
the fiducial model for a given mock observation. However, 
our calculations indicate that this is not the case in our 
case: the uncertainties on Xmax and D are nearly the same 
in Models 1 and 11. This is perhaps unsurprising, since one 
would not expect that a change in c or AT, (the additional 
free parameters in Model 1) can mimic changes in either 
-^^max and D. Indeed, the Fisher matrix is decoupled in 
the corresponding subspaces, as the (jCjnax, AT^), (XmaxjC); 
(D,ATs), iD,c) components are negligible compared to 
the diagonal elements. The uncertainties in the parameter 
estimators are {5c/c,5ATs/ ATs,5x^a^/x^^„,5D/x^.^y) = 
(4%, 6%, 9.5%, 25%) for (Mvh.,z,a,c, A7:,,Xniax,£>) = 
(IO'^Mq, 0.3, 1.5,2.9, 1.2mK,2c,0.5c). Therefore c. A?;, 
and Xmax will be precisely obtained with ALMA for the 
majority of the clusters, while similar precision for D is 
possible for only massive clusters. We note that in principle, 
it is possible to deduce the value of a directly from the c{a) 
relation. 

Figure |5] provides confidence in our conclusions from the 
Fisher matrix method: the \a contours in {x^^^^D) are well- 
approximated by ellipses, if the fiducial values obey D- AD > 
0, e.g. D > 0.1, JCmax < l-7c, andMvir > IO^^Mq. Therefore, 
in these cases, the inequality (I41> assumes equality for the pa- 
rameter uncertainty. We find that the parameter distribution 
around D « is distorted, and higher confidence level con- 
tours for arbitrary D are banana shaped in the x„i.^^-D plane. 

Our calculations show that increasing x^ax increases the un- 
certainties Axmax and AD, while increasing D only slightly 
increases Axmax and leaves AD unchanged. It is clear that 
the choice of a does not alter the Axn,ax and AD uncertain- 
ties. Changing z yields a maximum in Ax^ax and AD around 
z = 0.4. We find that the parameter uncertainties decrease by a 
factor of 10 at z w or z w 3, compared to the z = 0.4 value. 

8. CONCLUSIONS 

We have shown that the forthcoming Atacama Large Mil- 
limeter Array (ALMA) is well-suited for studies of the 
intra-cluster medium density distribution using the Sunyaev- 
Zel'dovich (SZ) effect. The angular beam diameter and sen- 
sitivities are predicted to reach 2" and 10/iK for this system, 
which exceed present detector resolutions by more than two 
orders of magnitude. The SZ decrement profile of a rich 
galaxy cluster, observed at 100 GHz for ^30 hours, will have 
a sufficiently high signal-to-noise ratio to make inferences 
about the gas distributions (and, by inference, about the dark 
matter distributions). 

We solved the equations of hydrostatic equilibrium for the 
self-similar gas density in the dark matter background. The 
dark matter profile was parameterized by its inner slope a, and 
every calculation was evaluated for the two values common in 
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Parameter uncertainty (z = 0.3, Model II) 
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TABLE 6 

Parameter uncertainty (z = 0.3, Model I) 
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the literature, i.e. a = 1 and 1.5. Within this framework, the 
density as a function of radial distance is small but nonzero, 
even in the outer regions. As a modification, we introduced a 
linear cutoff in the density profile, where the density falls to 
zero within a finite radial distance. This assumption mimics 
the presence of strong virial shocks, predicted in cosmo log- 
ical structure form ation theories (see iBertschingeJI 198^ and 
iBagchi et alj200 2). 

We have calculated the SZ decrement profiles of clusters 
with and without the presence of a sharp density cutoff near 
the virial radius. We assumed 10/iK flat-power Gaussian 
noise. We set up the likelihood function and defined the deci- 
sion rule to see whether our fiducial model with a virial shock 
can be distinguished from a test model without a virial shock. 
It is important to emphasize that this test is practically model- 
independent: it is only weakly sensitive to the central regions, 
where cluster models tend to differ, with most of the sensitiv- 
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ity arising from the outer regions, where the various cluster 
models in the literature are very similar. We calculated the 
signal to noise ratio in two ways, with either four (Model 1) 
or two (Model II) free parameters. These parameters are the 
radial distance to the virial shock, x,^.^^, the radial thickness of 
the cutoff, D, the concentration parameter, c, and the temper- 
ature decrement scale, AT^, for Model I. For Model II, c and 
AT, were calculated from theory assuming no scatter around 
the correct values. Other parameters, such as the virial mass, 
Mvii , the redshift, z, and the logarithmic slope of the inner dark 
matter profile, a, can, in principle, be obtained from indepen- 
dent measurements, and were not varied. The signal to noise 
ratios agree to within 20% in Model I and Model II. The over- 
all value is S/N=470 for a rich cluster of virial mass lO'^M© 
and we find S/N=16 for a lower-mass cluster with 

We also considered a more realistic case, in which the pa- 
rameters of the cluster with and without a virial shock were 
fitted independently for a given measurement. We find a large 
systematic bias in the parameters c and AT^ for the model 
without a virial shock, provided that the cluster does have a 
virial shock within approximately two virial radius. Neverthe- 
less, the signal to noise ratio decreases only by 50% at most 
for detecting the virial shock. Therefore, the virial shocks re- 
main detectable forM > 1O''*M0 clusters. 

Finally, we examined the precision to which the 
parameters of the virial shock could be obtained. 
We find the typical values for the precisions for 
low-mass (lO''* M©) and rich (lO'^^ M©) clus- 
ters to be (&/c,(5Ar(0)/Ar(0),(5w/c,(5D/w) = 
(4%, 6%, 9.5%, 25%) and (0.6%, 0.3%, 0.3%, 0.9%), respec- 
tively. These results assume the cluster is at z = 0.3, and 
it is surrounded by a virial shock at twice the virial radius 
(xm-dx = 2c) that has a finite width D = 0.5c. If the shock was 
located closer in (jCmax = c) and was sharper (D = 0.01c), these 
uncertainties would improve to (3. 7%, 6.1%, 2. 8%, 12%) 
and (0.6%, 0.4%, 0.1%, 0.4%), respectively. Since there is 
a one-to-one theoretical correspondence between c and a, 
with c(a = l)/c(a = 1.5) = 1.7, SZ measurements should 
help in determining the value of a. This should serve as a 
valuable diagnostic of the inner gas profile; in particular, of 
the presence of any excess (non-gravitational) entropy that 
could strongly modify the inner profile. 

Our analysis is based on a simplified, spherically symmetric 
model for the cluster. In these models, the virial shock appears 
in the SZ surface brightness maps as a full {2tt azimuthal an- 
gle) ring-like feature. In reality, the gas-infall into the cluster 
potential should be inhomogeneous and anisotropic. As a re- 
sult, the radial location and the strength of the virial shock 
will vary in different directions awa y fro m the cluster's cen- 
ter ('Evrardl ll990t iBrvan & Norman 1998). We partially ac- 
counted for this effect by allowing the virial shock to have 
a finite width (D), which "smears out" the ring-like feature 
- mimicking the effect of projecting shocks of varying ra- 
dial locations and strengths. Simulations also show that gas 
is falling into the cluster potential along overdense filaments, 
and that the strongest shocks occur along these filaments. If 
the projected ring-like feature extended over only a fraction 
/ of the full 2tt azimuthal angle (corresponding to the pro- 
jected areas of the filaments), then our predicted S/N for de- 
tecting the virial shock would be further reduced by a factor 
1 / ^/f. The virial shock will nevertheless remain detectable 
for aM ^ 10 '^M© cluster at a significance of S'/A^ > 10, un- 
less / < lo-l 



Our results suggest that SZ decrement measurements with 
ALMA can reveal virial shocks around galaxy clusters, and 
can determine the location and size of these shocks with high 
precision. This will provide a unique constraint on theories of 
large-scale structure formation. 

We thank Gil Holder, Greg Bryan, David Helfand, and San- 
dor Molnar for useful discussions. We acknowledge sup- 
port from OTKA through grant nos. T037548, T047042, and 
T047244. 

APPENDIX A: DISTINGUISHING MODELS WITH THE OPTIMAL 
FILTER 

In this appendix, we give a formal summary of the treat- 
ment of the following general problem. Consider a system 
that obeys laws that predict a set of observables h"'^. What 
is the significance at which a false hypothesis, predicting a 
different set of observables, h"^'*', can be rejected, given the 
presence of Gaussian random noise? 

A measurement of the observable in xk distinct directions 
yields a discrete sample where fcG [1,^]. The set {ytjis an 
element of a /T-dimensional vector space and will be denoted 
by y. Similarly, let h'^'' and h"''^* denote the discrete sample of 
the hypotheses functions h^'^{x) and h^'^^^x). In our example, 
y denotes a measured SZE temperature profile, whereas h'^'' 
and h'"'*' denote the discrete temperature profiles predicted in 
models with and without a cutoff. The vectors h*''' and h"''*' 
depend on the parameters describing the cluster and the cut- 
off. We now derive the signficance (or "signal to noise ratio") 
of distinguishing between two models. 

If the real signal arriving to the detector was s(x), then the 
detector measures the data 



y = s + n 



(42) 



where n is a random variable, corresponding to the noise. Let 
us assume white Gaussian noise of variance cr^. In this case, 
the probability of detecting y, given that the incoming signal 
is s, is 



P{y\s): 



1 



(27r)^/2(a^)^ 



exp 



(y-sY 



24 



(43) 



where the arithmetic (difference and scalar product) of the K- 
dimensional vectors was used, i.e. 



(44) 



k=l 



Two hypotheses can be distinguished using P{s \ y), which 
can be obtained from P(y \ s) and the a-priori probabilities 
P{s) by applying the Bayes theorem for conditional probabili- 
ties. Most common decision rules, such as the maximum pos- 
teriori, t he Neyman-Pearson, or the minimax decisions (e.g. 
lWhalerlT97 ll) involve constraints on the likelihood ratio. 



p(y|h'-') 



P(y|hfid)- 

Substituting i43i in i45i the log-likelihood becomes 



lnL(y)=^(h" 



-hfi'»).(y- 



24 



(h" 



-h«'^)2 



(45) 



(46) 



The decision is made in favor of the test hypothesis if the 
likelihood ratio exceeds a given threshold. The likelihood de- 
pends on the y-h*''' component along h'^'-h'^''. The term 
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jjtest jjfid referred to as the matched filter, which is the op- 
timal filter for the distinction of the two hypotheses in white 
Gaussian noise. 

Since the noise distribution P(n) is spherically symmetric 
in the K-dimensional vector space, it has the same power cr|, 
along any basis. This implies that the noise power for the 
likelihood detection rule is the noise power for a single bin, 
i.e. A^^ = crjf. Thus increasing the sample size increases the 
signal power, but leaves the relevant noise contribution the 
same. 

The signal power is 

-test Lfid\2 \ ^//^test ^fid\2 ^ f j2. 



(h h ) 



k 



Ax2 



where Ah(x) = h^'^^\x) - h^'^{x) and Ax^ denotes the area en- 
closed by the neighboring x^ points. We have assumed that 
the resolution of the measurement is fine enough to approxi- 
mate the sum with the integral. Therefore, the signal to noise 
ratio is 

52 ^ (^test_^fid)2 



(48) 



The signal to noise ratio is an important measure of the sig- 
nificance of the test. Let us define p as a one-dimensional 
random variable by 



p(y) = 



(jjtest_jjtid-).(y_jjlld) 

||h"=st-hfid||crN 



(49) 



for which P(p\s = h^"^) is a standard normal distribution, 
whereas P{p \ s = h"^'*') is a normal distribution with mean S/N 
and unit variance. Thus, if the fiducial model is true, fol- 
lows statistics, and the significance of the test can be ob- 
tained by comparing S^/N^ with the confidence levels. 

In our case, for a detector with angular resolution Acf) and a 
cluster of apparent angular virial radius Q the surface element 
is Ax^ = A/K, where K = (jcj^|'^e)^/(cA(/))^ is the total num- 
ber of pixels and A = ttx\ 
Therefore, 



l^l'x is the surface area of the cluster. 



e 



//d=,[,....,,)-*"WP. (50, 



is the signal to noise ratio for the decision between the two 
hypotheses. 

APPENDIX B: DISTINGUISHING MODELS WITH FREE 
PARAMETERS 

Here we address the following problem, which is a gener- 
alized version of the problem posed in Appendix A above. 
Consider again a system that obeys laws that predict a set of 
observables h'^'^. What is the significance at which a false hy- 
pothesis, predicting a different set of observables, h"^^', can be 
rejected, given the presence of a Gaussian random noise, ;/ 
the parameters of the false test model are not known a priori 
and can be freely adjusted! 

Assume that the original signal (i.e. without noise) is h*''' 
with a set of A^'^'' parameters, p^^, and the false hypotheses is 
described by A^"^^' parameters, p"^'*'. The measurement 



y = hflVo') + n 



(51) 



can be used to give an estimate of p\^. As in Appendix A, 
y is the collection of K observables (e.g. the SZ brightness 



of the K pixels), and n is a /T-dimensional Gaussian random 
variable. Denote the estimated parameter of h'"'^ by p^'^ and 
the estimated parameter of h"^^' by /j"^^'. Once the parameters 
have been obtained, h^^ip^^) and h"^^'(/?"^'*') can be fixed at 
the corresponding values. Then the likelihood ratio for fixed 
parameters can be used, according to equation ( I46> . and the 
decision is made in favor of the test model rather than the fidu- 
cial model exactly if the likelihood is above a given threshold. 

Since p^'^ is obtained by minimizing (y-}n^'^(p^'^yf-, the dis- 
tribution of h^'^ip^'^) from (jsT) is 

h«V') = h«W) + A,, (52) 
where p^^ is the real parameter of the signal without the noise 
and A I is an A^*''' dimensional Gaussian random variable. ^ 



(•47-) Ai has zero mean and variance N^'^aj/. Similarly, /?'' 



obtained by minimizing 

(y-h"='*'(/?"''))^- The distribution of 

jjtest(^test) 

h'^^'(p"=^') = h"=^'(pr) + A2 (53) 
where p^'^^ is the p'"*' parameter value at which {h^'^ipQ'^) - 
jjtest^^test-)-)2 minimal. A2 is defined by equation (I53> . which 
is an A^"^^'-dimensional Gaussian random variable, with zero 
mean and variance A^^^^'cr^. Note that Ai and A2 are strongly 
correlated since both values are derived from the same mea- 
surement y. 
The signal power 

^2^(jjtest_^fid-)2 (54) 

is therefore a random variable. Let us define the empirical, 
most probable^ and expected signal powers by 

5Lp = (h'^^'(y'=^')-h«V'^))2 (55) 
Sl = (h'<''\p'^'')-h^\pl'^)f = min(h"=^'(/?"'')-h'"'(p||'^))^ 

(56) 



(57) 



The expected signal power can be written in terms of So 
using eqs. (15 2> and i53\ 

5e\p = {[h'''\pr')-h^Hpl'') + A2-Aif) (58) 
= Sl + {(A2-Aif) (59) 
= Sl + {Al) + {Ai)-2{AiA2) (60) 
= 5,^ + (A^«'^+A^"=Vw-2(AiA2). (61) 

In particular if A^'''^ = A^"='*' = 1, then (A1A2) = cos{<j))al/2. 

This term can be obtained, for given h*''' and h"'^' hypothesis 

functions by 



COS((/)) = 



dh™(piJ'') 
dpj" 


dh'^^'C/jlf'') 

dp;- 


dh"''(;;J,"') 




dh""(Po") 









(62) 



For arbitrary numbers of parameters, the expected value of 
(A1A2) has a more complicated algebraic form, but can al- 
ways be obtained for a specific choice of h'''' and h"^^' func- 
tions. For simplicity we shall restrict only to the inequality 



(A1A2) >0, 



(63) 



^ Note that A 1 is a Gaussian random variable in terms of the set of param- 
eters p*"^ only if the various choices for these parameters span a flat (linear) 
subspace. This is an adequate approximation, provided that the radius of 
curvature is much less than the rm.s. variance of the parameters, (\/N^<jn). 

^ The estimated parameters have a Gaussian distribution around pjj'' and 
Pq^'. The term "most probable" refers to the parameter distribution not the 
signal power distribution. 
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leading to 



Sl<Si<Sl + iN''' + Nn4- 



(64) 



The noise power for choosing between h'^'' and h"''^* (with an 
arbitrary p^'^ or p"^'*') is the variance of the measurement along 
jjtest(-^test-)_j,fid(-^^fid-)^ since the HkeHhood ratio (|46} depends 
on only this component. Therefore 

jjtest(^test-)_jjfid(^fid) -'2\ 



exp 



(y-hfiV')) 



= 4+ 



Jjtest^ptest^ _ Jjfid^pfid 

^test(^test)_^fid(^fid) 
|ljtest(^test)_ljfid(^fid) 



■•A, 



(65) 
(66) 



For large signal to noise ratios this can be approximated to 
lowest order in Ai. 



exp 



^test(^test)_^t,d(^^ljd) 
|ljtest(^test)_^iid(^& 



•A, 



= 4+((cos0)2)a 



where 9 is the angle between the vector h"^^'(/:'y - h' 
and h^'^(p'^'^)-h^'^(pl'^). In general for any (hfi^h"='') 



(67) 



(68) 
fid(^fid) 



cTl<Nip<2al 



—^''exp — ^'^N- (69) 

Equation ( I69> is a natural consequence of the fact that there 
are two sources of uncertainties for the hypothesis test, corre- 
sponding to the uncertain estimates of the parameters of the 
fiducial and test models. In principle, these uncertainties can 
add up constructively to create a 2cr^ variance in the likeli- 
hood ratio, but the noise power is at least the noise of the 
fixed models of Appendix A, lcr|,. 

In particular, for the SZE brightness hypotheses, the test 
model is a special case of the fiducial model, with the fiducial 
model having two additional parameters describing the cutoff 
at the virial shock. In this case for A^*''' = 4 and A^"^^' = 2, it can 
be shown that ((cos 6)^) w 1/4 independent of p^'^. Therefore 



(70) 



Although the expected signal to noise ratio can be cal- 
culated explicitly for a given parameter choice using equa- 
tion ( I6H and ilOi . it is useful to define its the theoretical 
bounds independent of the given form of the hypotheses. 
Comparing equations (I64> and (I69> . we obtain 
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< 



W 

'exp 



< 4+A?^''+V 



(71) 



where So is given by equation (I56> . The high bound is ap- 
proached when //"^*' tends to be parallel to H^'^, the low bound 
is approached when //"^^' is orthogonal to H^'^ near H^'^ip^'^) 
and H^'^^Xpq''^). Here H denotes the sub-space of the K- 
dimensional vector space that is spanned by the variable pa- 
rameters. 

APPENDIX C: PARAMETER BIAS FOR A FALSE HYPOTHESIS 

Here we present an estimate for the values of the best- 
fitting parameters of a false hypothesis. This problem can be 
contrasted with the application of the Fisher matrix. While 
the latter is a method to obtain the variance on the parameters 
of the true model, here we seek an approximation of the ex- 
pectation values for the best-fitting parameters of a false test 



model (regardless of the uncertainties around this best fit, and 
whether the best is an acceptable fit). 

Finding py'*', the minimum value in the p"^'*' parameter 
space in general can be computationally tedious. Our treat- 
ment is general as long as the fiducial hypotheses has all of 
the parameters of the test hypotheses plus some additional pa- 
rameters and if the two hypotheses deviate by only a reason- 
ably small amount. In particular, as displayed in the panels of 
Figure|5] the numerical value obtained naively by 

^test(O) ^ ( AT^test^ ^test) ^ ( A^/^ c"'^) (72) 

is a fair zeroth approximation for the two cluster models in 
most cases. A better approximation can be obtained by ex- 
panding [h"^*'(/?"^*')-h'^'*(/?*''')]2 to second order around 
and finding its minimum in terms of p"^^'. 

(jjtest(^test)_^fid(^fid))2_ 

(h^^t - h'^'')^ + 2(9ih"=^') • (h'"^' - h^^)xk (73) 
+ [{djh'''') ■ (dkh'"'') + djdkh''^'' ■ (h"=^' - h^'^)]xjXk 

where it is assumed that h"''*' and h*''' are evaluated at /Jq^''"^ 
and p^'^, respectively. The indices j and k run across the test 
model parameters, andxj; = /'o^'^'^-/'o^'^"\ furthermore 



k 

d 



Let us denote the quadratic coefficient by 



Mkj = idkh'''') ■ (9;h"=^') + (h"=^'-hfi'') • dkdjti 



(74) 



(75) 



Equation ( I73l l gives the deviation of the signal to noise ratio if 
the test model parameters are modified from the fiducial val- 
ues. The expected signal to noise corresponds to the minimum 
value in terms of Xk, at which the Xk derivative must vanish 



_d_ 

dxk 



[/^test(^test)_^hd(^hd)]2 



(76) 



2((9th"=^') • (h'"'-h*"') + 2M*yX;- = 0. 



Since Xk = /'o^'^ ' -/'o^T ' '■^^ solution of this equation gives 
the next approximation of pg^'. Equation M6\ is solved by 
inverting the coefficient matrix. Thus 



„test(l) 

Pa k '■ 



„test(0) 

■Pq k ' 



M^](a^h"=^')-(h" 



-hfi<»), 



(77) 



Po*'f ^ is the unproved approximation of the ^* parameter 
of Note that the Al^-dependence of the hypotheses 

is linear, implying dATM'''\p''''') = h"=^'(c'"', Arf^' = 1) and 
h''''*'(/?"'^') = for all p"'^'. The c"''*' parameter derivatives 
are to be calculated numerically using equation J21l i. In con- 
clusion, finding the minimum in equation i35\ is simplified to 
evaluating parameter derivatives of the h"^^' function at only 

„test(0) 

Pq 

Figure |5l shows how the best fitting ^' parameters are 
related to the naive choice p^'^ when detecting virializa- 
tion shocks with the SZ effect. The S/N function has a 
minimum at p^^^K Finding the critical point with equa- 
tion ilH yields the correct global minimum for (Mvir,z,Q;) = 
(1O'^M0,O.3, 1.5)forany (x„,ax,£>) and (10'4Mq,0.3, 1.5) for 
any (xmax,£>) unless x^ax ~ c. However, for (Mvir,z,Q;,Xniax) ~ 
(1O''*M0,O.3, 1.5,c) there are multiple local minima and sad- 
dle points. The /?g = '"''^ approximation of equation J77> 



Virial Shock Detection in Galaxy Clusters with the Sunyaev-Zel'dovich Effect 



15 



yields the local minima in the vicinity, which is not the 
global minimum. For (Mvir,z,a) « (1O'^M0,O.3, 1.5) and ai"- 
bitrary (jCmaxj^)), the Pg^'*^'' value corresponds to the saddle 
point. Again, the = approximation of equation illi 

breaks down. 

The saddle points can be identified by calculating detM,j 
, ,^ / /es/N > iff p''^"' is at a local minimum or maximum 

(78) 

The saddle point can be eluded if detM^y is calculated and 

a step is made towards only if detM^^y > 0. If it is 

negative, then a step is made "downhill" along the negative 



gradient -2(a*h"=^') • (h"=^'-h«'^). 

Repeating equation J77> or the downhill steps gives a 
numerical efficient way of obtaining better and better ap- 
proximations of the best fit test parameter values near the 
corresponding fiducial parameters. Further study shows 
that this algorithm leads to the correct p^^^'^^ value in most 
cases. Only the fiducial parameters around p = (Mvir,z, a) w 
(1014Mo,0.3, 1.5), with (xn,ax,£>) = (c, 0.01c) or (c,0.5c) have 
non-global minimum in the close neighborhood of Pq'^. In 
this case the best fitting test model parameters have to be ob- 
tained by evaluating equation i56\ at multiple parameter val- 
ues, scanning the parameter space. 
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